Language Models

Permit me to pique your interest: Self-Taught Optimizer (STOP) This paper reveals a powerful new capability of large language models - the ability to recursively improve how they...

Carlos E. Perez

@IntuitMachine

Do language models have an internal world model? A sense of time? At multiple spatiotemporal scales? In a new paper with @tegmark we provide evidence that they do by finding a lit...

Wes Gurnee

@wesg52

list of some of the most popular LLMs and LLMs datasets: LLMs Bard (Google AI) Bard large language model ChatGPT (OpenAI) ChatGPT large language model Claude (Anthropic) Claude...

Toorkhan❤🇵🇰

@MrToorKhan

5 Advanced ChatGPT prompt techniques that will put you ahead of the world: (🧵 A thread)

The Rundown AI

@TheRundownAI

Nearly all recently-proposed large language models (LLMs) are based upon the decoder-only transformer architecture. But, is this always the best architecture to use? It depends… 🧵...

Cameron R. Wolfe, Ph.D.

@cwolferesearch

there are lots of threads like “THE 10 best prompts for ChatGPT” this is not one of those prompt engineering is evolving beyond simple ideas like few-shot learning and CoT reason...

Alex Albert

@alexalbert__

Each “block” of a large language model (LLM) is comprised of self-attention and a feed-forward transformation. However, the exact self-attention variant used by LLMs is masked, mul...

Cameron R. Wolfe, Ph.D.

@cwolferesearch

ChatGPT is all the rave. But what does GPT actually mean? Here’s a quick breakdown so you don’t get left behind:

Alex Banks

@thealexbanks

1/ In 2021, we shared next-gen language + conversation capabilities powered by our Language Model for Dialogue Applications (LaMDA). Coming soon: Bard, a new experimental conversat...

Sundar Pichai

@sundarpichai

Three years in the making - our big review/position piece on the capabilities of large language models (LLMs) from the cognitive science perspective. Thread below! 1/ https://t....

Anna Ivanova

@neuranna

In text generation, how do you get a large language model to be more (or less) creative? 🎨 Depending on your use case, you may want the model to be: 1. Very creative, or 2. Very p...

cohere

@CohereAI

@schachin @cemper ? They are based on language models, derived from corpuses of content. Things like Wiki (factual), news (factual/perspective) etc. Break away from those, and the...

Darth Autocrat (Lyndon NA)

@darth_na

Unroll Thread